AD-8174  777  TESTS  CONDITIONAL  ON  IMBALANCE  NITH  BIASED  COIN  DESIGNS  171 
(U)  FLORIDA  STATE  UNIV  TALLAHASSEE  DEPT  OF  STATISTICS 
M  HOLLANDER  ET  AL  JUL  8b  FSU-STAT 1ST ICS-K724 
UNCLASSIFIED  AFOSR-TR-86-2819  F/G  12/1  NL 


AD- A 174  777 


AFOSR-TR.  8  6-  2019 


0! 


TESTS  CONDITIONAL  ON  IMBALANCE  WITH 
BIASED  COIN  DESIGNS 
by 

Myles  Hollander  and  Edsel  Pena 


Department  of  Statistics 
Florida  State  University 
Tallahassee,  Florida  32306-3033 


July,  1986 


FSU  Statistics  Report  No.  M734 
AFOSR  Technical  Report  No.  86-189 


DTIC 

ELECTE 
DEC  8  1986 


B 


Research  supported  by  the  Air  Force  Office  of  Scientific  Research  under 
^  Grant  AFOSR  F49620-85-C-0007  to  the  Florida  State  University. 

O 

o 

Key  Words:  Biased  coin  design;  Conditioning;  Exchangeable;  Markov  chain; 
\  i  J  Randomization  test. 


"DISTRIBUTION  STATEMENT  A 

Approved  for  public  releaee; 
Distribution  Unlimited 


96  tA  09  OAt 


SECURITY  CLASSIFICATION  OF  THIS  FACE 


1 


I  !  ’I 
**#■ 


iv! 


3 


51 


*!v 


Vl 


( 


; 


ft 


1*.  REPORT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 


REPORT  DOCUMENTATION  PAGE 


lb.  RESTRICTIVE  MARKINGS 


2a  SECURITY  CLASSIFICATION  AUTHORITY 

NA 


2b.  OCCLASSIFICATION/DOWNGRAOING  schedule 

NA 


3.  oiStribution/availasility  of  report 

Approved  for  Public  Release;  Distribution 
Unlimited 


4.  PERFORMING  ORGANIZATION  REPORT  NUMBERIS) 

FSU  Statistics  Report  M734 


S.  MONITORING  ORGANIZATION  REPORT  NUMBERIS) 

_ AFOSRTR. 


6a  NAME  of  PERFORMING  ORGANIZATION 

Florida  State  University 


bk  OFFICE  SYMBOL  7a  NAME  OF  MONITORING  ORGANIZATION 

at 

AFOSR-10 


6a  ADDRESS  (City.  Situ  tmt  ZIP  Cut* I 

Department  of  Statistics 
Tallahassee,  FL  32306-3033 


7b.  AOORESS  (City.  Situ  mnt  ZIP  C 04*1 

Bldg.  410 

Bolling  Air  Force  Base,  DC  20332-6448 


84.  NAME  of  FUNOING/SPONSORING 
ORGANIZATION 


AFOSR 


Be.  ADDRESS  (City.  Suit  tut  ZIP  Cott) 


B.  PROCUREMENT  INSTRUMENT  IDENTIFICATION  NUMBER 


F49620-85-C-0007 


70.  SOURCE  OF  FUNDING  NOS. 


PROGRAM 

Bldg.  410  element  NO. 

Bolling  AFB,  DC  20332-6448 _  6.1102F 

II.  TITLE  Unciutt  Security  Clttlifictlionl  — 

TESTS  CONDITIONAL  ON  IMBALANCE  WITH  BIASED  COIN  DESIGNS 


6.1102F 


PROJECT 

task 

NO. 

NO. 

2304 

45 

WORK  UNIT 
NO. 


1 12.  PERSONAL  AUTHORtS) 

1  Myles  Hollander  and  Edsel  Pena 

u  J  "  1 

13a  TYPE  OF  REPORT 

Technical 

130k  TIME  COVERED 

FROM  TO 

'var.Twryw— 

■  iriTiPMi 

IS.  PAGE  COUNT 

20 

IS.  SUBJECT  TERMS  (Com  art  mi  om  iwnrrbt  it  ntttmtry  tmd  ittntify  by  blotb  nmmbtn 

Biased  coin  design;  Conditioning;  Exchangeable;  Markov 
chain;  Randomization  test. 


10.  ABSTRACT  (Continnt  on  iwvtnt  it  mwify  iM  ittntify  by  btneb  nmmbtn 


Distributional  properties  of  the  treatment  assignment  variables  T^,  ...,  Ti  under 
i’s  (1971)  biased  coin  design  are  derived.  These  properties  are  conditional  on  the 


Efron’s  (1971)  biased  coin  design  are  derived.  These  properties  are  conditional  on  the 
terminal  imbalance  of  the  treatment  allocation.  /Recursive  procedures  are  presented  for 
obtaining  the  conditional  moments  of  ....  T^.  Based  on  these  results,  large-sample 
test  statistics  are  proposed  for  the  randomization  test  of  the  null  hypothesis  of  no 
treatment  difference.  In  contrast  to  Efron's  test  statistic,  the  approximations  herein 
proposed  are  applicable  when  there  is  a  treatment  allocation  imbalance.  }  j-y.-*  . 

/  ,,  ,  ,  .-/  .  ..  i4V'  r*  ■"  '  >,  -  J  :.  —  .  P  4.  -A'.  •*  "-4.-M  JL4 


l  V-'.  •*  '-a-vi  iA  ) 


.J  '  - 


20.  OiSTRIBUTION/AVAILABILITY  of  abstract 
UNCLASSIFIEO/UNLIMITIoO  SAME  as  RPT.  □  OTIC  USERS  O 


23a  NAME  of  RESPONSIBLE  INOIVIOUAL 


21.  ABSTRACT  SECURITY  CLASSIFICATION 

UNCLASSIFIED 


~TTrrr\GtA 


DO  FORM  1473. 83  APR 


m 


irJ 


edition  of  1  JAN  73  IS  OBSOLETE. 


23c.  OFFICE  SYMBOL 

AF0SR/NM 


UNCLASSIFIED 


smm 


SECURITY  CLASSIFICATION  OF  This  PAC 

as 


1.  INTRODUCTION 


Consider  a  clinical  trial  to  compare  the  efficacies  of  treatments  A  and  B. 

It  is  typical  in  these  trials  that  patient  arrival  and  treatment  assignment  is 
sequential.  To  obtain  a  balanced  or  nearly  balanced  treatment  allocation, 
without  increasing  unacceptably  experimental  biases,  Efron  (1971)  proposed  a 
biased  coin  treatment  allocation.  Let  T^,  T£,  ...  denote  the  assignment  variables 
with  L  =  0  or  1  depending  on  whether  the  i*h  patient  receives  treatment  A  or  B, 
respectively.  With  Dg  =  0,  define  for  i  =  1,  2,  ... 

i 

Di  *  2  l  T  -  i.  (1.1) 

j  =  l  3 

Under  Efron's  (1971)  biased  coin  design  with  bias  p(%<p<  1),  hereon  abbreviated 
BCD(p),  the  probability  that  the  (i  +  l)**1  patient  receives  B  is  p,  %,  q  =  1-p 
according  to  whether  <,  =,  >  0,  respectively.  Wei  (1977,  1978)  generalized 
this  design  by  allowing  the  (i +  i)*h  patient's  assignment  probabilities  to  depend 
on  i  and  D^. 

We  confine  our  attention  to  a  clinical  trial  where  the  treatments  are  assigned 
to  n  patients  via  the  BCD(p) .  In  Section  2  distributional  properties  of  T^,  ...,  Tn, 
conditional  on  Dn,  are  presented.  Procedures  for  computing  approximations  to  the 
conditional  moments  of  Tj,  ...,  Tr  are  given  in  Section  3.  In  Section  4  we  discuss 
the  relevance  of  the  results  in  Sections  2  and  3  in  relation  to  the  randomization 
test  of  Hq,  the  null  hypothesis  of  no  treatment  difference.  We  consider  test 
statistics  of  the  form 


Sn  * 


n 

l 

i*l 


a..  T. 


(1.2) 


where  a^,  ...,  a^  is  a  nonrandom  sequence  of  scores  associated  with  the  sequence 
of  patient  responses  x^,  ....  xR.  The  two-sided  version  of  the  randomization  test 
thus  rejects  Hq  if  the  observed  SR  is  either  "too  small"  or  "too  large". 


1 


In  deciding  whether  a  test  statistic  is  too  small  or  too  large,  Cox  (1982) 
recommended  that  the  reference  randomization  distribution  should  be  taken  over 


those  treatment  allocations  with  the  same  or  nearly  the  same  terminal  imbalance 
as  the  observed  allocation.  In  accordance  with  Cox's  suggestion,  the  results  of 
Sections  2  and  3  are  conditional  on  Dn.  Note  however  that  some  authors  use  the 
unconditional  randomization  distribution  of  Sn(see  Smythe  and  Wei  (1983)  and 
Smith  (1984)). 

In  his  doctoral  dissertation,  E.  Pefia  derived  a  recursion  procedure  for 
obtaining  the  exact  randomization  distribution  of  Sn,  conditional  on  Dn  =  m.  This 
procedure  enables  one  to  perform  exact  significance  tests  of  Hq,  but  for  large 
sample  sizes  the  computer  time  required  to  implement  the  recursion  may  not  be 
acceptable.  In  Section  4  we  therefore  suggest  an  alternative  test  procedure  based 
on  approximating  the  distribution  of  Sn  by  the  normal  distribution.  In  Section  5 
a  computer  simulation  is  used  to  assess  the  adequacy  of  the  normal  approximation 
for  n  ■  51  and  n  *  101. 

Efron  (1971)  presented  approximations  to  the  conditional  parameters  of  Sn,  and 
suggested  approximating  the  conditional  randomization  distribution  of  S^  by  the 
normal  distribution.  However,  in  a  simulation  study,  Halpem  and  Brown  (1986) 
indicated  that  Efron's  approximations  are  inadequate  in  the  presence  of  treatment 
allocation  imbalance.  The  results  in  Section  2  offer  a  theoretical  explanation 
for  this  inadequacy,  and  in  conjunction  with  the  results  of  Section  3  provide 
improved  approximations  to  the  conditional  moments  of  Sn  whenever  Dn  *  0. 

2.  DISTRIBUTIONAL  PROPERTIES  OF  Tj,  ...,  TN 

Let  Z  be  the  set  of  integers,  Z+  the  set  of  positive  integers,  and  set 
Z^  *  Z+u{0}.  By  the  defining  property  of  the  BCD(p) ,  the  process  Dq,  D^, 
in  (1.1)  is  a  homogeneous  Markov  chain  with  state  space  Z  having  stationary 

Ml 


m 

(Ak 


2 


transition  probabilities 


Pi,j  “  PrCD1=j I Dg=i) »  j  e  Z,  i  c  Z 

h  if  j  -  *i,  i  -  o 

p  if  (j=i+l,i<0)  or  (j=i-l,i>0) 
q  if  (j*i-l,i<0)  or  (j=i+l,i>0) 
0  otherwise. 


(2.1) 


The  nth  order  transition  probabilities  of  this  chain  will  be  denoted  by 

(P1?  . ,  ieZ,  jeZ},  and  the  stationary  distribution  probabilities  by  (it.,  jeZ). 
^ » J  J 

These  stationary  probabilities  are 


*0  *  (p-q)/(2p)  and  it..  =  *  (p-q)/(4p2) (q/p)^'1,  j  e  Z+  .  (2.2) 


In  order  to  maintain  continuity  in  the  presentation  of  the  results,  all  proofs 
are  deferred  to  Section  6. 

Theorem  2.1:  If  m  e  Z  with  P«  „  >  0,  then  conditional  on  D  «  m, 

-  u,m  n 

(i)  T2i_i  and  T2i  are  exchangeable,  and 

(ii)  T^  and  T2^+1  aie  exchangeable  whenever  m  =  0,  for 
i  *  1,  ....  t11/ 2] ,  where  [k]  denotes  the  greatest 
integer  s  k. 

d  d  d 

Let  V  and  W  be  random  variables.  Recall  that  V(s,=,£)W  if  for  every  real 
number  v,  pr(Vsv) (£,a,s)  pr(tfsv) ,  respectively. 


Corollary  2.1:  If  m  €  Z  with  Pq  m 
T2i-2»  T2i-1 ’  T2i’  T2i+1*  V 


>  0,  then  conditional  on  Dn  =  m,  (Tj, 
»  (Tx,  ...,  T2i_2,  T2i,  T2i_x,  T2i+1, 


•**»  Tn^ 


[n/2) . 


for  every  i  *  1, 


9 


On  the  otherhand,  it  is  not  true  that,  conditional  on  D  =0, 

(T1>  •••»  T2i-1*  T2i*  T2i+1*  T2i+2*  *•*»  V  =  (T1»  ***'  T2i-1»  T2i+1»  T2i» 

T2i+2*  Tn^  Eor  i  *  1»  •••»  tn/23  ^  the  following  example  illustrates. 


Example  2.1:  Ltader  the  BCD(p),  pr(Tj=l,  T2=0,  T3=l,  T4=0tD0=0,  D4=0)  * 

prcr^i.iyi,  t3=o,  t4=o|d0=o,  d4=o). 

However,  aside  from  Theorem  2.1  (ii) ,  we  have  the  following  results  for  the 
pairs  (T2i,  T2i+1),  i  =  1,  In/2] . 


Theorem  2.2:  If  m  e  Z  with  p!J  _  >  0  then,  conditional  on  D  =  m.  T„.  . 

-  0,m  *  n  *  2iv  *  *  *  2x+l 

according  to  whether  m(>,=,<)  0,  respectively,  for  i  =  1,  ...,  p/2]. 


Corollary  2.1  implies  that  the  randomization  distribution  of  Sn,  conditional 
on  Dr  =  m,  is  invariant  with  respect  to  permutations  of  members  in  the  doubletons 
{a2i  1,a2i^  for  *  *  1»  ••*»  [n/2] •  Thus  there  are  at  most  II  C(n-2i,2)  permutations 
of  alf  ...,  an  that  give  distinct  conditional  randomization  distributions  for  Sn, 
where  n  represents  the  product  from  i  =  0  to  i  =  [°/2]  and  C(b,a)  denotes  the 
number  of  combinations  of  a  objects  taken  from  b  distinct  objects.  Unfortunately, 
this  bound  increases  rapidly  with  n,  making  it  impractical  to  construct  tables  of 
critical  values  for  the  conditional  randomization  distribution  of  Sn. 

That  the  conditional  randomization  distribution  of  Sn  depends  on  the  particular 
order  of  a^,  ...,  an  under  the  BCD(p)  is  in  contrast  with  the  case  where  treat¬ 
ments  are  assigned  via  complete  randomization,  or  equivalently  via  BCD(^) .  In  the 
latter  situation,  T^  ...,  Tn  are  exchangeable,  conditional  on  Dn  =  m,  and  hence  the 
conditional  randomization  distribution  of  Sn  is  invariant  with  respect  to  permutations 


of 


al  >  •  •  • » 

Let  =  E(TilD0=0,Dn=m)  for  i  =  1, 


n  and  o. . 

ij,m 


for  i,j  *  1,  ...,n.  An  immediate  consequence  of  Theorems  2.1  and  2.2  is: 


4 


:■(»,! 

■ft 


i'.'i 

*»V 

$ 

$■ 


41 


& 

$ 


i*y 

V; 

[*% 

*IV 

v! 


!vl 


i 


I 


**i| 


9 


XI 


:S 

.•A 

* 


SS: 

$1 


I 


Corollary  2.2:  If  m  c  Z  with  PA  _  >  0,  then  for  i  =  1,  ....  P/2], 

v  j  in 


Ci)  4-i, ■  *  4,o  •  and 

Cii)  Uji.m  (s,-.2)  4.1, m  if  *(’>=, <)<>.  respectively. 


Furthermore,  since 


pn  =  Pn_1  /  (Pn_1  +  P11"1  ) 
ul,m  *l,m'  ltl,m  -1,«J* 


(2.3) 


then  p"  m  (>,=,<)%  according  to  whether  m(>,=,<)0.  By  Corollary  2.2  and  using 


the  relation  a1}.  =  p11  fl-p?  )  it  follows  that: 

li, m  i,m  i,m 


Corollary  2.3:  If  m  e  Z  with  Pq  m  >  °*  then  for  i  =  1,  ...,  n. 


(i)  p.  (>,=,<)%  if  m(>,=,<)0,  respectively,  and 
x 


(ii)  o^  m(=,<)J«  if  m(=,*)0,  respectively. 


With  the  object  of  evaluating  the  conditional  mean  of  Sn,  it  is  of  interest 


to  know  if  the  p?  m's  can  be  well-approximated  by  %{l+(m/n)},  the  latter  being  the 
conditional  mean  of  each  under  complete  randomization.  To  resolve  this  question, 
we  resort  to  limiting  values.  Since  DQ,  Dj,  ...  is  a  Markov  chain  of  period  2, 


then  Pg^m  411(1  ^"im-l  converSe  to  2l,2m  40(1  2ir2m-l’  resPectively*  Using  (2.2) 


and  (2.3)  it  follows  that  p"  m  converges  to  \  as  n  tends  to  infinity.  By  a 


similar  argument,  p“  m  can  be  shown  to  converge  to  p,  %  or  q  depending  on  whether 


m  >,  =,  or  <  0,  respectively.  Therefore,  if  m  *  0,  the  absolute  difference  between 


pjj  m  and  Pj  m  tends  to  p  -  k  as  n  increases.  Consequently,  it  is  our  view  that 
approximating  the  p?  m's  by  %(1 +  (ra/n)  )  is  not  advisable.  The  inadequacy  of 
these  approximations  is  further  illustrated  by  results  of  the  simulation  in  Section 


•'vsx* 


3.  CONDITIONAL  MEANS  AND  COVARIANCES 


In  this  section  we  present  recursive  procedures  for  computing  the  conditional 
means  and  covariances  of  T^,  . ..,  Tn>  For  ease  of  implementation,  but  at  the  cost 
of  obtaining  only  approximate  results,  we  follow  Efron  (1971)  and  assume  that 
the  process  DQ,  D^  ...  is  at  its  stationary  state.  The  results  below  thus  pertain 
to  the  process  DQ,  D^,  ...  satisfying 

Pr(D0=j)  =  *y  j  «  z,  (3.1) 

where  (ir  jeZ)  are  given  in  (2.2). 


Theorem  3.1:  If  (3.1)  is  satisfied,  then  for  m  e  Z 


(i)  yi,m  =  ul‘m+1’  1  =  •••>  ^  n  €  Z+» 


n  n-i+1 

(n)  a... 


. .  .  -a..  . 

iz+j,m  ll+j,m 


,  j  »  1,  ...,  n  •  i;  i=l,  ...,n-l;  n  e  Z+. 


Theorem  3.1  implies  that  to  compute  y?  m  for  i  =  1,  ...,  n,  one  must  simply 


k 


obtain  Uj  m  for  k  =  1,  ...,  n;  and,  similarly,  for  the  covariances.  The  following 

theorems  provide  methods  for  obtaining  y,  and  a,,  .  for  k  =  1,  ...,  n. 
r  °  l,m  ll+j,m 


Theorem  3.2;  If  (3.1)  is  satisfied,  then 

k  -1  k 

y,  =  ir  L 
l,m  m  ra 

where  (L^)  satisfy  the  recursion  equation 


Lk  =  y  .  Lk"l  +  (1-y  ,)Lk'J  ,  k  =  2,  3,  ... 
m  'm-l  m-1  v  Tra+r  ra+1  *  *  * 

with  initial  and  boundary  conditions  l*  =  ir  ,  y  , ,  m  e  Z,  and  where  y  =  p,  Jg,  q 

'  m  m-l  'm-l*  *  m  r*  n 

according  to  whether  ra(<,=,>)0,  respectively. 


Theorem  3.3;  If  (3.1)  is  satisfied,  then 


E(TiW  J  -  »*  •••• "  - 1 


where  (L*)  satisfy  the  recursion  equation 

Lm  =  Ym-1  Lm-1  +  ^m+l5  Lm+1»  k  =  2  +  J»3  +  J»  ••• 

with  initial  and  boundary  conditions  L1+-*  =  y  ,  tt  ,,  m  e  Z. 

m  m- 1  m- 1  l ,  m-  JL  ’ 

The  conditional  covariances  are  then  obtained  through  the  relation 

"U.j,.  ■  ECVujlDk'")  - 


and  Theorem  3.1. 

To  illustrate  the  adequacy  of  these  approximations  for  a  moderate  sample  size, 

the  exact  and  approximate  conditional  means  of  T^  ...,  Tn  for  n  =  51,  p  *  z/$  and 

m  =  1,  3  and  5  were  computed.  Exact  values  were  obtained  using  procedures  in 

Pena's  dissertation.  The  results  are  summarized  in  Table  1.  Notice  that  for 

practical  purposes  it  suffices  to  use  the  approximate  means  (and  variances)  for 

this  value  of  n  and  p.  The  simulation  results  in  the  next  section  indicate  however 

that  when  the  approximate  covariances  are  used  to  compute  the  standard  deviation 

of  S„,  this  could  lead  to  overestimation  of  the  exact  standard  deviation  of  S_ 
n*  n 

for  some  score  sequences  a^,  ...,  an. 


4.  LARGE-SAMPLE  TEST  PROCEDURES 


With  a^  =  x^  -  x  in  (1.2)  Efron  (1971)  standardized  Sn  as 

Z1 - ^ - 


(’■*  l  *1*11  aiVij) 
i=l  i*j  1  J 


\ 


(4.1) 


7 


Table  3.1.  Summary  of  exact  and  approximate  conditional 

means  y?  for  n  =  51,  p  *  \  and  m  =  1,  3  and  5 
i,m  r  3 


3 

m 

' 

Exact  Approximate 

Exact  Approximate 

Exact  Approximate 

! 

1 

.5002 

.5002 

.5008 

.5009 

.5024 

.5027 

3 

.5002 

.5002 

.5011 

.5011 

.5031 

.5032 

5 

.5002 

.5002 

.5013 

.5013 

.5037 

.5038 

MM 

.5003 

.5003 

.5016 

.5016 

.5045 

.5044 

1 

.S003 

.5003 

.5019 

.5018 

.5053 

.5052 

ii 

.5004 

.5004 

.5022 

.5022 

.5063 

.5062 

13 

.5005 

.5005 

.5027 

.5026 

.5075 

.5073 

15 

.5006 

.5006 

.5032 

.5031 

.5089 

.5086 

17 

.5007 

.5007 

.5038 

.5037 

.5106 

.5103 

19 

•  S008 

.5008 

.5046 

.5044 

.5126 

.5122 

21 

.5010 

.5010 

.5055 

.5053 

.5150 

.5146 

23 

.5012 

.5012 

.5066 

.5065 

.5179 

.5174 

25 

.5015 

.5015 

.5080 

.5078 

.5214 

.5209 

27 

.5018 

.5018 

.5098 

.5096 

.5257 

.5251 

29 

.5023 

.5022 

.5120 

.5117 

.5309 

.5302 

31 

.5028 

.5028 

.5147 

.5145 

.5373 

.5366 

33 

.5035 

.5035 

.5182 

.5180 

.5451 

.5444 

35 

.5045 

.5044 

.5228 

.5225 

.5548 

.5540 

37 

.5058 

.5057 

.5288 

.5285 

.5668 

.5659 

39 

.5075 

.5075 

.5367 

.5364 

.5817 

.5806 

41 

.5101 

.5100 

.5476 

.5472 

.5999 

.5987 

43 

.5139 

.5138 

.5628 

.5624 

.6217 

.6203 

45 

.5200 

.5199 

.5848 

.5844 

.6462 

.6447 

47 

.5309 

.5309 

.6178 

.6173 

.6683 

.  6667 

49 

.5556 

.5556 

.6672 

.6667 

.6683 

.6667 

51 

.6668 

.6667 

.6672 

.6667 

.6683 

.6667 

where  =  cov(T^,T\)  denotes  the  unconditional  covariance  between  T\  and  T\. 

He  suggested  approximating  the  conditional  randomization  distribution  of  2^  by 

the  standard  normal  distribution.  Efron  assumed  that  there  was  complete  balance 

in  the  treatment  allocation,  that  is,  DR  =  0.  In  that  case  has  conditional 

mean  0  but  its  conditional  variance  may  not  be  1.0  since  in  the  denominator  of 

(4.1)  o. .  was  used  instead  of  ck1.  Furthermore,  if  D  =  m  *  0,  Corollary  2.3 
lj  13 ,m  n 

shows  that  has  a  nonzero  conditional  mean  and  a  conditional  variance  that  will 
tend  to  be  smaller  than  1.0.  Thus,  assuming  that  the  distribution  of  is 
normal,  the  standard  normal  approximation  may  go  awry.  This  offers  a  theoretical 
explanation  for  the  simulation-based  conclusion  of  Halpem  and  Brown  (1986)  that 
the  standard  normal  approximation  to  Zj  is  inadequate  in  the  presence  of  treatment 
imbalance. 

With  the  ability  to  compute  the  conditional  means  and  covariances  of  the  T*s 
using  results  of  Section  3,  we  propose  the  use  of  the  test  statistic 


l  a. (T. -u"  J 

{  l  a?o*.  +Ha.a.on. 
i=l  1  11»m  i4  1  3  13 


(4.2) 


where  the  a^,  ...,  an  are  standardized  to  satisfy  £  a.  =0.  If  the  standard 

i=l 

normal  distribution  is  then  used  to  approximate  the  distribution  of  Z2,  the 
a- level  two-sided  randomization  test  rejects  HQ  if  jz2|  >  z0/2»  w^ere  za/2  *s 
the  (1-  o/2) 100%  percentile  of  the  standard  normal  distribution. 

Efron  (1971)  showed  that  when  the  sequence  a^,  ...,  an  is  noisy,  the  covariance 
term  in  (4.1)  given  by 


K  =  7  T  a.  a.  a.  . 

M  1  j 


(4.3) 


is  near  0.  We  expect  the  covariance  term  in  (4.2)  given  by 


(4.4) 


i  *  j 


a.  a. 
i  J 


n 
a.  . 
10,111 


to  have  roughly  the  same  behaviour  as  K.  If  the  score  sequence  is  noisy  we  thus 
prefer  to  use  the  computationally  simpler  test  statistic 

n 


n  0  . 

r  V  n 
{  )  a.  a. .  } 

i“l  i  n, m 


rather  than  Zy 

We  are  not  able  to  provide  a  theoretical  justification  for  the  standard 

normal  approximation  to  the  conditional  randomization  distribution  of  Z^,  Z^, 

and  Zj.  This  problem  of  proving  that  the  conditional  randomization  distributions 

converge  in  distribution  to  the  normal  distribution  is  a  difficult  theoretical 

problem,  moreso  under  the  BCD(p) .  Under  Wei's  (1978)  adaptive  biased  coin  design, 

Wei,  Smythe  and  Smith  (1986)  approached  this  problem  by  proving  the  joint  asymptotic 

normality  of  (sn»Dn) .  and  then  conjecturing  that  Sn,  conditional  on  Dr,  is 

asymptotically  normal.  However,  Smythe  and  Wei  (1983)  pointed  out  that  the  uncon- 

n 

ditional  distribution  of  Sn  =  \  a^  under  the  BCD(p)  may  not  be  asymptotically 

i=l 

normal.  The  approach  of  Wei,  Smythe,  and  Smith  may  not  therefore  work  for  our 
problem  since  (Sn,Dn)  may  not  be  jointly  asymptotically  normal. 

A  computer  simulation  was  performed  to  examine  the  adequacy  of  the  normal 
approximation  to  the  conditional  randomization  distributions  of  Z^,  Z^,  and  Zy 
Details  of  this  study  are  discussed  in  the  next  section. 


5.  A  SIMULATION  STUDY 


The  simulation  was  performed  at  the  Florida  State  University  Computing  Center 
on  a  Control  Data  Cyber  730  computer.  The  uniform  random  number  generator  used 
was  the  intrinsic  routine  RANF. 
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N 


Two  sample  sizes,  n  ■  51  and  n  *  101,  were  considered  in  the  simulation,  and 
the  scores  in  (1.2)  were  set  to  =  rank  (x^)  -  (n+l)/2.  The  covariance  quantity 
K  in  (4.3)  was  set  to  0.  For  each  sample  size,  five  score  sequences  were  considered, 
and  as  a  measure  of  the  noise  of  each  sequence  their  respective  correlation  coeffi¬ 
cients  with  1,  2,  ...,  n  were  computed.  A  low  absolute  correlation  coefficient 
indicates  a  high  degree  of  noise.  These  correlation  coefficients  are  summarized 
in  Table  2. 


Table  2.  Correlation  coefficients  for  the  10  rank  sequences 
in  the  simulation  study. 


Sample 
Size,  n 

Sequence  Number 

1 

2 

3 

4 

5 

51 

0.0724 

0.1919 

0.3719 

0.7252 

1.00 

101 

-0.1988 

0.0071 

0.3244 

0.6827 

1.00 

For  each  score  sequence,  10000  replicates  of  assignment  variates  t^,  . . . ,  tn 
were  generated  via  the  BCD(p)  with  p  =  %.  For  each  of  these  replicates,  the 
values  of  Z^,  Z2,  and  Zj  were  computed,  with  the  conditional  means  and  covariances 
of  Tt,  ...»  Tn  approximated  by  the  procedures  in  Section  3.  After  stratifying  the 
10000  replicates  according  to  their  value  of  Dn  *  m,  the  mean,  standard  deviation, 
percentages  of  values  greater  than  1.645,  1.96,  and  2.33  of  Z^,  Z2,  and  Zj  were 
determined.  The  results  are  summarized  in  Table  3  for  n  =  51,  and  in  Table  4  for 
n  ■  101.  Those  values  that  are  superscripted  by  one  or  two  asterisks  are  signi¬ 
ficantly  different,  based  on  the  classical  Z-test,  from  what  is  expected  under 
the  standard  normal  distribution  at  level  0.05  or  0.01,  respectively. 

Tables  3  and  4  indicate  that  the  adequacy  of  the  standard  normal  approximation 
depends  on  the  degree  of  noise  of  the  score  sequence.  The  more  noise  there  is  in 
the  sequence,  the  better  the  approximation.  Notice  that  for  sequences  1  and  2 
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which  are  the  noisiest  sequences,  the  percentages  of  values  of  Zj,  Z2  and  Z 3 
greater  than  1.645,  1.96,  and  2.33,  for  most  values  of  m,  are  statistically 
consistent  with  the  5.0,  2.5,  and  1.0  percent  that  could  be  expected  under  the 
standard  normal. 

For  almost  all  cases,  the  means  of  Z2  and  Z^  are  statistically  consistent  with 

0;  in  contrast,  the  means  of  Z^  are,  for  most  cases,  statistically  different  from  0. 

This  supports  our  earlier  observations  that  the  approximations  to  the  u”  m's 

in  Section  3  are  good,  and  that  %{l+(m/n)>  should  not  be  substituted  for  the 

yV  's.  Note  also  that  the  departure  from  0  of  the  mean  of  Z.  increases  as  the 

l  ,m  i 

treatment  allocation  imbalance  increases.  Aside  from  that,  as  n  increases,  the 
mean  of  Z^  tends  to  increase  or  decrease  according  to  whether  the  correlation  is 
positive  or  negative,  respectively. 

The  behaviours  of  Z2  and  Z^  for  sequences  1  and  2  are  almost  identical, 
implying  that  the  covariance  term  K"  in  (4.4)  is  near  0.  Their  standard  deviations 
become  more  different  however,  as  we  go  along  sequences  3,  4  and  5  for  both  values 
of  n.  This  implies  that  when  the  score  sequence  is  noisy,  the  covariance  term 
K"  should  not  be  ignored,  and  Z2  should  be  the  preferred  test  statistic.  On  the 
otherhand,  the  standard  deviations  of  Z^  and  Z^  are  almost  identical  for  all  cases. 

We  expect  Z2's  standard  deviation  to  be  1.0;  however,  the  observed  standard 
deviations  of  Z2  for  sequences  4  and  5  are  significantly  lower  than  1.0.  The 
increase  in  sample  size  from  n  =  51  to  n  =  101  did  not  improve  the  approximation. 

This  indicates  that  the  approximations  to  the  covariances  given  in  Section  3  lead 
to  an  overestimate  of  K"  in  (4.4).  Consequently,  if  Z2  is  computed  using  the  approxi¬ 
mations  in  Section  3,  and  the  standard  normal  is  used  to  approximate  the  distribution 
of  Z2,  the  randomization  test  which  rejects  HQ  if  |Z2|  >  za^2  will  tend  to  be  a 
conservative  test  for  non-noisy  score  sequences. 


‘Significantly  different  from  what  is  expected  under  the  standard  normal  it  o  •  0.05. 
•Significantly  different  from  uhat  is  exepcted  under  the  standard  normal  at  o  •  0.01. 


'Significantly  different  froa  what  is  expected  under  the  standard  normal  at  o  *  0.0S. 
'Significantly  different  froa  what  is  expected  under  the  standard  noraal  at  o  ■  0.01. 


6.  PROOFS 


Before  proving  the  theorems  of  Sections  2  and  3  we  state  the  following  results 
which  were  proved  in  Pena's  dissertation. 

Lemma  6.1;  The  process  DQ,  ...  is  symmetric  in  the  sense  that 


for  every  d0»  •••»  dn  with  d^  €  Z. 

Corollary  6.1:  P1?  ..  =  Pn.  ,  i  e  2,  j  e  Z,  n  e  Z®  . 

Lemma  6.2:  If  x,y  c  {0,1}  and  j,  k  e  Z,  then 

(i)  a(x,y,j,k)  =  0  whenever  j  and  k  are  even, 

(ii)  a(x,y,j,k)  *  0  whenever  j  and  k  are  odd  and  of  opposite  signs,  and 

Ciii)  a(x,y,j,k)  *  a(l-x,l-y,-j ,-k) , 

where  a(x,y,j,k)  =  pr(T1=x,T2=y|D()=j ,D2=k)  -  pr(Tj=y,T2=x| DQ=j ,D2=k) . 


Proof  of  Theorem  2.1:  Let  x  and  y  take  arbitrary  values  in  {0,1}.  To  prove  (i) , 
note  that  pr(T2i_j=x,T2i=y|D0=>0,Dn=ra)  =  EQ^m  pr(T2^_1=x,T2i=y|D0=0,D2^_2,D2i,Dn=m) 
where  EQ  denotes  expectation  conditional  on  DQ  =  0,  Dn  =  m.  By  the  Markov  property 
of  DQ,  Dj,  ...  it  follows  that  pr(T2il=x,T2i*y| Dg*0,Dn*m)  =  EQ ^  pr(T2il=x, 
T2isylD2i  2,D2i^*  To  con,Plete  t^ie  proof  it  suffices  to  show  that,  for  j  and  k  even, 

Pr(T2i-l3X’T2iayl D2i-2^*D2i=k)  *  Pr^T2i-lay»T2iaxl °2i-2^ >D2i*k)  * 

By  the  homogeneity  of  Dq,  D^,  ...  this  reduces  to  showing  that,  for  j  and  k  even, 
pr(Tj»x,T2»y|  DQ-j  ,D2*k)  *  pr(T1»y,T2*x|  DQ«j  ,D2»k) . 
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Using  Lemma  6.2(i),  part  (i)  of  Theorem  2.1  follows. 

To  prove  (ii),  note  that  pr(T2i=x,T2i+1=y| DQ=0,Dn=m)  =  EQ^m  pr(T2i=x, 
T2i+1=y|D0=0,D2i_1,D2i+1,Dn=m) ,  which  by  the  Markov  property  of  DQ,  ,  ...  becomes 

Pr(T2ia!X»T2i+l=ylD0=0,Dna!m^  =  E0,m  pr^T2isx,T2i+l=yl D2i-l’D2i+l^ *  To  comPlete  the 
proof  of  (ii)  it  remains  to  show  that  8  =  EQ  mtPr(T2i=x»T2i+i*ylD2i-l,D2i+l^ 

-  Pr(T2i=y,T2i+l=xl D2i-l,D2i+l^  3  0*  By  expanding  the  expectation  and  noting  that 
°2i  1  °2i+l  take  integer  values  only,  we  have 

^  ~  1  I  PrCD2i_iss3»D2i+i:sklDo=0»Dn=n1^  *  ^Pr^T2i=x,T2i+l=y^D2i-l=^  ,D2i+l=k-) 

3  k 

"  Pr^T2i:sy,T2i+l=sxl  D2i-1*^  *D2i+l“k)  * 

•ll  {Pr(D2i-l=3.D2i+i=k!Do=0,Dns'“)  0(x*y*j»ki  +  Pr^D2i-l=~^  ,D2i+l3_k 
j>0  k>0  ^ 

I Do=0,Dn=m^  aCx»y»-j ,-k> } 

*  l  l  {PrCD2i.1*j,D2i*rklDOs0»Dn=m)  afx»y»j»k^  +  Pr(D2i-r-j»D2i+r“k 

j>0  k<0 

lD0=°,Dn=m)  a(x,y,-j  ,-k) } 

the  second  equality  obtained  using  the  homogeneity  of  DQ,  D^,  ...  .  By  Lemma 
6.2(ii)  the  second  double  summation  equals  0,  and  now  letting  Dn  =  m  =  0  and  by 
Lemma  6.1  and  Corollary  6.1,  we  have 

l  pr(D?.  ,=j,D?.  1=k|DQ=0,D  =0){o(x,y,j,k)  ♦  a(x,y,-j ,-k)} 
j>0  k>0  ^  1  n 

s  I  I  pr(D_.  T=j ,D..  .=k| DQ=0,D  =0) (a(x,y, j ,k)  +  a(l-x,l-y,j ,k) } 
j>0  k>0  1  v  n 


where  we  used  Lemma  6.2(iii).  But  since  ot(x,y,j,k)  +  o(l-x,l-y, j ,k)  =  0  for  every 
x,  y  «  {0,1}  and  j  and  k  odd,  part  (ii)  of  Theorem  2.1  is  proved.  || 


n 


m. 


n 


Proof  of  Corollary  2.1. 
Then,  by  letting  *  {1, 


Let  (tj,...,^)  with  tj  €  {0,1}  and  2 
. . . ,2i-2,2i+l, . . . ,n). 


pr{ 


n  CTisti)lD0=0'Dn=n,)}  =  Pr{  n  I Do=0,Dn=n^Pr^T2i-l=t2i-l,T2i=t2il D0=0» 

j  =  l  J  J  jeJj. 


n  (Tj*tj) ,Dn=m} . 


jeJi 


By  the  Markov  property  of  Dq,D^,  ...  and  using  the  result  in  the  proof  of  Theorem 
2.1  which  states  that  pr(T2i.1!Sx,T2i=y|D2i_2=j ,D2i=k)  =  pr(T2i_1=y,T2i=x| D2i_2=j ,D2i=k) 
for  every  j  and  k  even,  it  follows  that 


PrCT2i-l't2i-l"T2i=t2ilV0*.nI  <W'Vm)  *  Pr(T2i-rt2i-T2i=t2i-llD0'0- 

n  (T.=t.),Dn=m), 
jeJi 


hence 


Proof  of  Theorem  2.2:  Define 

e  =  pr(T2i+1=l|D0=0,Dn=m)  -  pr(T2i=l |D0=0,Dn=m) . 

Then,  by  the  Markov  property  of  DQ,  D^,  ...  we  have 

8  *  E0,m(Pr‘T2i.l-1lD2i-l>D2i.l)  *  Pr'T2i-l|D2i-l-D2i*l)}- 

Using  the  homogeneity  of  DQ,  Dj,  ...  and  (2.1)  we  obtain 


Pr(T2i*l*1lD2i-l*j»D2i+l*j)  '  PrCT2i=1lD 


*  pr(T2«l|D0=j,D2=j)  -  prCTj-llDQ-j.Dj-j) 


0 

C  j  (p%  -  pq) 
cj  Cpq  -  p^) 
.  0 


if  j  =  3,  5,  . . . 
if  j  =  1 
if  j  s  *1 

if  j  =  -3,  -5,  ... 


2  -1 

where  C.  =  (P.  .)  .  By  Corollary  6.1  note  that  C.  *  C  ..  Since  the  relevant 

)  j  tj  i  -l 

values  of  D2i_j  in  (6.1)  are  odd  integers,  and  noting  that  (6.1)  equals  zero 
whenever  D2il  *  °2i+l*  At  follows  that 


B  *  C1(pJs-pq){pr(D2i_1«l,D2i+1=l|D0=0,Dn=m) 

-  Pr C °2i- 1*- 1 » °2i+ 1=‘ 1 1 D0=0 » Dn=m) } * 


By  an  application  of  Lemma  6.1,  we  obtain 


8  •  CltP^-PqH',S,n)'1pr(D21.1»l,D2lil=l|D0.0)(P^i-^;8‘} 


It  follows  that  0(2,  =  ,is)0  according  to  whether  m(>,=,<)0  respectively. 


Proof  of  Theorem  3.1:  Using  (3.1)  we  have 


Vi, a  =  PrtTi-l.D,-)  -  PnVl.Vi.r")  *  V7‘ 

I  ,m 

which  proves  (i) .  Similarly, 


n-i+1 


ffmE^TiTi+j  lDn=m^  =  Pr(Tisl>Ti+j  =  1*Dn!sm)  3  Pr(Ti=1  »Ti+jsl»Dn-i+rm^ 
=  irmE^TlTl+j  I  ^n-i+l3"^ 
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so  that 


a 


Since  the  proofs 
latter. 


n 

ii+j  ,m 


E(T.T.  . 
v  1  1+3 


n  n 
u .  u  .  . 

*i,m*i+j  ,m 


_  .  n-i+1  n-i+1  n-i+1 

TlTl+j ^ Dn-i+l"m^  "  ul,m  wl+j,m~  all+j,m 


of  Theorems  3.2  and  3.3  are  similar,  we  present  only  the 


Proof  of  Theorem  3.3:  By  the  assumed  stationarity, 

"mEtTlTHjlDnmn)  *  Pr'Tl*1"Tl.r1'Dn=") 

*  Vlpr(1i’1’Tl*j"1*Dn-l’m"1)  * 

■  VlVlE(TlTl.)IVl-«  *  «-vn.1)ViE0'iTujlD„-i'"'*1)- 

The  recursion  equation  is  then  obtained  by  letting  *  ir^CTjTj^^  |Dk=m) .  The 
initial  and  boundary  conditions  follow  from 

CJ  -  Pr(Ti.l,Twj.l,Du..») 

■  pr(Dj™-l)pr(T1-l|D^m-l)pr(T1.J»l,D1).j=lii|DJ.m-l)  =  U 
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